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Abstract 

We consider the ensemble of Nx N random symmetric matrices {An, p } 
with independent entries such that An. p has, in average, p non-zero ele- 
ments per row. We study the asymptotic behaviour of the maximal (in 
magnitude) eigenvalue A max (iV) of matrices An, p /^/p when they are large 
and sparse, i.e. in the limit N,p — * oo, p — o(N). We prove that the value 
p c ~ log TV is the critical one for A ma x(oo) to be bounded. 

Our arguments are based on the calculus of the tree-type graphs for 
random matrices with independent entries. Asymptotic properties of 
sparse random matrices essentially depend on the typical degree of a tree 
vertex that we prove to be finite. 
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1 Introduction 

The study of eigenvalue distribution of iV-dimensional random matrices in the 
limit N — > oo has been initiated by E.Wigner jL4j. He considered the ensemble 
of real symmetric matrices 

A N (x,y) = a(x,y), x,y=l,...,N (1.1) 
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where {a(x, y), x < y, x, y € N} arc jointly independent random variables with 
zero mean and variance 1. The celebrated semicircle (or Wigner) law states 
that the distribution function of eigenvalues \[ N ^ < . . . < of the matrix 



An = An /yN given by relation 

a(X;A N ) = #{j: Af> < A}^ 1 (1.2) 
converges in the average as N — > oo to the nonrandom distribution a w (A) 



, (x) _ \ (2 7 r)- 1 V4^ ) if |A| < 2 , , 

awW -\0, if|A|>2 (L3) 

provided all even moments of a(x, y) exist and the odd ones vanish. 

Wigner introduced random matrices (1.1) to model distribution of energy 
levels of heavy atomic nuclei. Such a nucleus consists of large number (N ~ 100) 
of particles that interact each with other. According to this, the statistical 
description naturally presumes consideration of random matrices whose entries 
are random variables of the same order of magnitude (see e.g. |IT[ ). 

Recent applications of random matrices lead to various generalizations of the 
Wigner ensemble {An}- In particular, in the neural network theory the dilute 
version of (1.1) is used to describe the system, where some randomly chosen 
links between elements are broken (see e.g. [Q). Such a dilute matrix can be 
determined as 

A N<d (x,y) = a(x,y)d N (x 1 y), d N (x,y) = d N (y,x), 

where = {dN{x,y), x < y, x,y = 1, ...,JV} is a family of independent 
random variables, also independent from {a(x,y)}, with the distribution 

d N {x,y) = {]: Wi J Pr °™ y ^ /7V; Mr P<N. (1.4) 
v ,y; \0, with probability 1 - p/N; y ~ y J 

It is not hard to see that the An 4 has, in average, p non-zero elements per 
row. Thus, {Ana} could be regarded as the ensemble of matrices intermediate 
between the discrete version of random Schroedinger operator and the Wigner 
random matrices. This explains the interest to dilute random matrices from the 
spectral theory point of view (see e.g. ||). 

The asymptotic properties of the eigenvalue distribution of A^ were con- 
sidered in different aspects jj], [l(], In particular, it was shown in |Q and 
proved in |j) that the eigenvalue distribution of the matrix 

a( n( x 'V) = a i x iV)dN{x,y) : d N (x,y) = —d N (x,y) (1.5) 

Vp 

converges in probability in the limit N, p —> 00 to the semicircle distribution 
ff«(A) (1.3); 

p- lim cr(A; Aff) = <r w {\) (1.6) 
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that means that the semicircle law holds for dilute random matrices. 

The eigenvalue distribution function concerns a fraction of eigenvalues of the 
matrix. In applications it is often important to know whether the norm of A$ 

is bounded (see e.g. ||), i.e. whether there are eigenvalues of A^ outside of 
the support of a' w {X) in the limit N, p — > oo. 

In present paper we consider the probability distribution of the spectral norm 

H^HmaxilA^I, |A^>|}=A&ri 

where A^ p ^ < . . . < X^'^ are eigenvalues of A$ . We study the asymptotic 
behaviour of ||A^|| when matrices A$ are large and sparse, i.e. in the limit 
N,p — > oo, p — o(N). Our results show that the rate p c = log AT is the critical 
one for the limit of ||A^|| to be either bounded (in this case it is equal to 2) or 
unbounded. 



2 Main results and scheme of the proof 

First of all, let us note that one can determine the collection of random variables 
A = {An, N £ N} on the same probability space with the help of the 
procedure due to C.Newman (see e.g. We also determine {a(x, y), x,y € 
N} on the same ft a . 

We denote the measure and mathematical expectation corresponding to A 
by fid and angles (•), respectively. We also denote by E{-} the mathematical 
expectation with respect to the measure fi a generated by the random variables 
{a(x,y), x,y € N}. 

We assume that the random variables a{x, y) satisfy conditions: 

E[a(x,y)] 2fe+1 = Vx,y,fceN, (2.1a) 

and 

E[a(x, y)f = 1, sup E[a(x, y)] 2k = V 2k < oo Vfc S N. (2.16) 
Theorem 2.1 

Assume that the random variables a(x,y) are such that there exists S > 
such that 

V 2k < k Sk Vfc e N. (2.2) 
Then in the limit N, p — > oo such that p satisfies condition 

P 

(log N) 1+s ' 

with 5' > 25 the norm \\Aff\l is bounded; 

p- hm \\A$\\=2. (2.3) 

Jv ,p— *oo 
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Remarks. 

1. In fact, we prove that under conditions of theorem 2.1 the estimate 

Ma (g) Md {w: P^ll >2(l + 2e)}<exp{-ip N (s,S)logN} (2.4) 

holds with (pn(eS) — 0((logN) s ). This implies boundedness of lim ||A^|| with 
probability 1. This fact together with (1.6) implies (2.3). 

2. If there exists such U that V 2k < U 2k , then (2.4) and (2.3) hold for all 
5' > 0. 

To show that condition p ^> (\ogN) 1+s is necessary for convergence (2.3), 
we consider the simplest case of Bernoulli random variables 

A , , _ J 1, with probability 1/2; 
«*>V>-\-l, with probability 1/2. 

Theorem 2.2 

Let N, p increase infinitely in the way that 

P — - (2.5) 



(log N) 

for any positive 5 > 1 . Then for any given R > 

lim HaSwiw- W^nW > -K} = 1 > ( 2 -6) 

N ,p — >oo 

where A£ is given by (1.4), (1-5) with a{x,y) replaced by a(x,y). 

Now we describe the scheme of the proof of Theorem 2.1. First of all, let us 
recall that the semicircle law proved in Q] states that the moments 

Mf° = E J X 3 da(X; A N ) = E^ Tr ti N (2.7) 

converges in the limit N — > oo to the numbers 

X^da w (X), irij 



t k , for j = 2k; 
0, forj = 2fc + l, 



where 

(2fc)! 
k k\(k + l)V 
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Regarding the average 



Tr ti N = - 72 E E a(z, f Oo(Wi. 2/2) • • • afcy-i, or) (2.9) 

a; — 1 yi — 1 

as a sum over paths W(x, = (j/o = x,yi,y2, ■ ■ ■ , Uj-i,yo) of j steps, Wigner 

observed that the leading contribution to (2.7) is given by the paths that are 
related with the simple walks in the upper half-plane started and ended at zero. 
Basing on this fact, he derived recurrent relations for the limiting moments 

fc-i 

£ fc = ^tfc_i-;ii, to = l. (2.10) 

1=0 

In paper [|| the paths W(x, ij-i) were encoded with the help of simple 
connected graphs with no cycles. More precisely, it was shown that the terms of 
leading contribution to (2.9) can be related with the set Tk of one rooted trees 
with k edges drawn in the upper half-plane. This representation is equivalent 
to the simple walks description of |L4| and the number \Tk\ of elements in Tk is 
given by tk- Introducing some additional encoding for the trajectories W that 
provide vanishing contribution, the authors have studied asymptotic behaviour 
of the moments M^' for all j <C N 1 ^ 6 , N — > 00 and proved boundedness of the 

limit of ||Ajv||. These results were recently improved in paper Jl3||, where the 
case of j -C TV 2 / 3 has been investigated. 

Also let us note the paper where the necessary and sufficient condition 
for finiteness of limjv-^oo || Aiv|| are found. These conditions are rather weak and 
require existence of the fourth moment of the random variables a(x,y). 

The methods developed in and |D| are different from that used in || . It 
is not clear whether they are applicable to the dilute random matrices or not. 
In present paper we follow the way of juj and || and develop it to be applied 
for the Wigner random matrices and their dilute versions as well. 

There are three principal observations that we are based on: 

(i) all paths W(x, Y%k— 1) with non-zero contribution can be separated uniquely 
into classes II(t) corresponding to the trees r £ Tk; 

(ii) the paths that fall into the same class II(t) are described by graphs j(t) 
that are obtained from r by gluing its vertices; 

(iii) the contribution of the path is given by the number of cycles in 7. 
These facts allow one to estimate easily the number of graphs 7 and to sum 

the corresponding contributions. 

In Section 3 we give precise account on the graph representation of paths in 
the sum 

M j N *~> = E(7V- 1 Tr [A$Y) = ^£ E ^< Y2k-x)(D N (x, Y 2k -x)) (2.11) 
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where 

A(x,Yj-i) = a(x,yi)a(yi;y 2 ) ■ ■■a(yj-i,x) 

and 

D N (x,Y 2k -i) = d N (x,y 1 ) ■ ■■dN{Vj-x,x). 

It follows from this graphical description that the terms of (2.11) that are of the 
order 0(p~ s ) are described by the graphs 7 that have cycles only of the length 
2. We show this in Section 4. 

In Section 5 we estimate the number of trees that can produce the cycles of 
the length 2. Basing on relation (2.10), we develop a kind of the tree calculus and 
show that the number of trees having vertices of large degree is exponentially 
small with respect to the total number of trees. 

In Section 6 we prove our main technical estimate 

r,s=l P 

with 8k — * as k — > 00. This inequality allows us to prove theorem 2.1. Theorem 
2.2 is proved in Section 7. 

We summarize our results and compare them with certain known facts from 
probability theory and related fields in Section 8. However, let us briefly discuss 
here one consequence of our results with respect to the Wigner random matrices 
that correspond to the case of p = N in (1.5). It follows from (2.12) with p = N 
that one can assume that moments ~E\a(x,y)\ 2m exists for m < mo and obtain 
boundedness of for all k <C N a , where a depends on mo. For example, 

using a version of standard truncation procedure (see e.g. a version presented 
in ||), one can easily show that that a = 1/4 requires mo = 8. This indicates a 
bridge between results of |!| and fl3|| . Relation (2.12) explains why the higher 
moments of the matrix entries a(x,y) disappear from the limiting expressions 
for M 2 fe. 



3 Even partitions and trees 

It is clear that conditions (2.1a) imply that the average ~EA(x, Yj-i) is non-zero 
if and only if the path W(x, ij-i) is even p3| , i.e. if each step (yi,yi+i) counted 
both in direct and inverse directions appears in W even number of times. This 
immediately lead to the conclusion that M}^$ = 0. 

In the case when j = 2k, an even path uniquely determines a partition ir of 
variables {yj, i = 0, . . . , 2k — 1} into groups. We call such a partition the even 
one. The following statement links even partitions and rooted trees drawn in 
the upper half-plane. Let us note that given such a tree, one can order its edges. 
We adopt that the edges arc enumerated from below and from the left. In other 
words, we say that the edge e is less than the edge e' when e is situated on the 
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path from the root to e' or when there is a vertex v such that the path from v 
to e is to the left with respect to the path from 1/ to e'. 

Proposition 3.1. 

The set H' 2k of all even partitions of the variables (jjo,yi, ■ ■ .y,2k-i) can be 
uniquely separated into classes II(t) labeled by the trees r £ Tfc. 

Proof. We are going to show that given a partition tt 6 U' 2k , one can create a 
graph g(Tr) that is uniquely determined by a tree r. We construct g(rr) with the 
help of natural procedure that simplifies approach suggested in j^] and resembles 
certain arguments of |^3) . 

The first step is to determine the number of different groups of variables in tt. 
This coincides with the number of vertices in g(ir). We refer to the vertex that 
corresponds to the group containing y as to the root. Edges of g(jr) correspond 
to steps Si = (yi,y i+ i), i = 0, . . . , 2k - 1. 

Next, one starts to go along W — (yo, ■ ■ ■ , V2k-i,yo) and draws the edges of 
g starting from the root and joining subsequently the vertices according to the 
order dictated by appearance of steps Si. One obtains closed connected graph 
with directed and ordered edges. 

Let us consider two vertices joined by one or more edges. Since all edges 
of g are ordered, the edges between two vertices are also ordered and can be 
numerated between themselves. We call the edge marked if it has an odd number 
in this inner enumeration. It is not hard to see that there are exactly k marked 
edges in g. 

Finally one has to go once more along g and enumerate the marked edges 
according to the order of their appearance in g. Regarding the marked edges 
only, one obtains the connected graph 7 with k numerated edges and one root. 
Let us call it the first part of the walk g. The unmarked steps of g make also 
connected graph with k edges that we call the second part of g. 

The graph 7 has one root and k numerated edges. One can uniquely restore 
the tree r excepting the cases when the edges i and i + 1 in 7 have both vertices 
common. In this case we accept that r is such that these vertices are consecutive. 

Proposition is proved. □ 

Let us note in conclusion that graph j(t) is obtained from r by gluing q > 
vertices among them. One can regard this as if 7 obtained by a procedure of q 
steps where on each step just one pair of vertices is glued. 

Also it should be stressed that there exist partitions tt ^ tt' that lead to 
the same graph 7(r). The difference between tt and tt' is that they may have 
different second parts of g and g' . This happens only when in 7 there are cycles 
of the length more than 2 with edges "correctly" enumerated, i.e. one can go 
along the the cycle and see edges with increasing numbers prescribed for the 
edges of r. We call these cycles as the correct loops. In all other cases the 
second part of g is uniquely reconstructed from 7. We give precise account on 
this property in Section 4. 
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4 Number and contributions of partitions 



It is obvious that the value of ~EA(x,Y 2 k-i) as well as that of (D N (x,Y 2 k-i)) 
does not change if one preserves the partition and moves variables y,. Therefore 
one can write relation 

M^' P) = £ BW, B(tt) = l^^EA^.rafc-iXDjv^rafc-i)) (4.1) 
7ren' fc x,j/i 

and Sx,y 4 denotes the sum taken over variables x,y±,..., y 2 k-i in the way such 
that the partition tx is preserved. 

Tree representation helps to compute the contribution provided by the sum 
over given even partition S'"'. This contribution depends on the form of the 
graph 7(7") that corresponds to tt. 

Let us introduce several terms. Assume that in a tree r there is a vertex v 
of degree m > 2. The set of edges adjacent to v we call the cluster of the power 
m. If two vertices of the edges belonging to the same cluster are glued, then we 
say that there is the cluster gluing in the tree r. The other gluings are called 
the ordinary ones. 

Proposition 4.1. 

Assume that 7(7-) is obtained from t with the help of q = r + s gluings, where 
s is the number of cluster gluings. Then 

BM = B( 7 )< J ^-f[V i , i . (4.2) 

i—l 

Remark. We say that the graph 7 with r + s gluings and corresponding 
partition tt are of the type (r, s) . 

The proof of Proposition 4.1 can be easily derived from two observations. 

The first one is that the number of vertices in 7 is equal to the number of 
groups of variables in tt. Thus there are (N — 1)(N — 2) • • • (N — (k — q + 1)) 
terms in the sum T,^> of (4.1). 

The second observation follows from the definitions (1.4) and (1.5). It is 
easy to see that if in 7 there exist two vertices /1 and v such that there are £ > 1 
edges (/j,, 1/), then such a multiple junction provides the factor Ea 2 ^(d^) = 
\ / 2£N~ 1 p 1 ~£ . If £ = 1, then the factor due to this simple junction is obviously 
1/N. 

It is clear that if there are s gluings in one cluster, then one obtains I multiple 
junctions with multiplicities £1, . . . , & such that £1 + ...£; = s + I. Taking into 
account elementary inequalities 

V26 • • • V2& < V r 2 ^ 1+ ...2C ; = V 2 ( s+ l) < V 4s , 
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one arrives at (4.2). □. 



Proposition 4.1 shows that all the partitions of the type (r, s) provide the 
same contribution. Now it remains to estimate their number. This number can 
be estimated by the number Af(r, s) of possibilities to make r + s gluings in the 
tree r. We also should multiply AT(r, s) by the number of different partitions 
that correspond to the same graph 7 of the type (r, s). 

Let us note here that the cluster gluings are independent from the other 
gluings in the sense that the number of possibilities to make the cluster gluing 
in 7 does not depend on the number of other gluings already performed. This 
observation together with (4.2) shows that one can treat these two types of 
gluings separately. 

It is not hard to see that given a graph j(t) of type (o, s), corresponding 
partition tt can be uniquely restored. This is because the way along 7 preserving 
the existent order can be performed uniquely. In other words, the first and the 
second parts of the graph g are uniquely determined by 7. Thus, the number 
of partitions of the type (0, s) coincides with the number D s (t) of possibilities 
to make s cluster gluings in r. 

The picture differs for the ordinary gluings leading to the cycles of the length 
greater than 2. Assume that there is a vertex v with m > 3 edges a and there 
is a cycle of the length I > 3 starting and ending at v. If this is the correct loop, 
then one can reconstruct the second part of the graph g in several ways because 
one can pass the cycle for the second time between passing the edges e^. Also 
each loop can be passed in two opposite directions and this makes double the 
total number of possible ways. 

Now let us estimate the number of partitions of the type (r, s) performed 
in a tree r that has / clusters of powers mi, . . . ,mj, respectively. Clearly, k < 
mi + . . . mi < 2k — 2. Let us assume that there are s cluster gluings such that 
in cluster i there are Sj gluings. Then, summarizing the arguments presented 
above, one can write that 



In this formula D T (r) is the number of possibilities to make r ordinary gluings in 
the tree t, Af T ({Ri}) is the number of different partitions that can be obtained 
from the graph 7(7-) due to presence of correct loops, and D T ({Si}) is the 
number of possibilities to make s cluster gluings in r. Here we have denoted 
Ri = (ri,r 2 , ■ ■ ■ ,n), and Si = (s 1 , s 2 , . . . , s;)> sucn that r i + r 2 + • • • + n = r. 
Obviously, s\ + S2 + ■ ■ ■ + si = s. Summations go over all possible combinations 
of n > and s, > satisfying conditions presented above. 




(4.3) 
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Elementary calculation shows that 

1 fk\ (k - 1\ (k - r + 1\ k 2r , , 

Next, assuming that all cycles obtained are correct, we can write that 

^ r)s ± Kmi) <±fl^i^L. (1 . 5) 

{Ri} W}*=1 V 11 

One can obtain the latter expression regarding the observation that if in a 
cluster i there are n correct loops, then the number of different ways to pass 
them is estimated by the number of possibilities to distribute different balls 
into rrii — boxes. We easily derive from (4.5) that 

r l r I r ■ 

K(r) < e n rt < ^ r e h n 7T ^ ( mi + • • • + m; ) r = ( 4fc ) r - ( 4 - e ) 

{K,}*=1 {Hi} *=1 

Let us turn to the number of cluster gluings. ft is estimated by the product 

Taking into account inequality Vi Sl ■ ■ ■ Vi Sl < Vi Sl we can write that 

!>({M)lK < ( ^ + -| + TO ' ) V 4s . (4.7) 

Trivial inequality for the variable Ejj^ (r) = m\ + . . . < 4/c 2 lead to the 
estimate k 2s /p s . This estimate is rather rough and is not sufficient for the proof 
of theorem 2.1. In the next section we show that ^ T E^(t) behaves like t k k 1+s 
that reflects the almost linear character of the average tree. This fact together 
with inequalities (4.5)-(4.7) implies (2.12). 

5 Enumeration of trees 

In this section we consider the set T k the one-root trees r having k edges drawn 
in the upper half-plane. Let us recall that given a tree t e Tk, one can order its 
edges. Let us also give several definitions. 

We refer to the edges adjacent to the root as to the root edges. We define 
the cluster n as the set of m edges that have one common vertex v. We call m 
and v the power and the center, respectively, of the cluster k. 
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The main result of this section is given by the following statement. 



Theorem 5.1. 

Let us consider the set G k m ^ C Xfc of trees that have at least one cluster 
with the power m > 2. Then the number of such trees \G k m ^\ = <7fc(m) satisfies 
relation 



9k(m) < kt k ^-j . (5.1) 

To prove theorem 5.1, we need the following auxiliary statement. 
Lemma 5.1. 

Let us consider the set of trees that have m > 2 root edges. Then 

(o\ m-2 

Proof. It is easy to see that 

k—m 

t k — t ai t a2 ■ ■ ■ t am , (5-3) 

a i 

where the sum is taken over all cti > such that ol\ + . . . + a m = k — m. 
Let us derive first the simple estimate 

4 m) < tfc-i- (5-4) 
Regarding (2.10), we can rewrite (5.3) in the form 

k—m k—m—q q 

(m) _VVii---i Vi t - 

L k / / / j L a! L a 2 L a m -2 / , v a m -i v a m 

q—0 cti cti 
k — m k — m — q 

^ ] ^ ' taita 2 ' ' ' ta m _ 2 tq+l- (5-5) 
q—0 cti 

The latter sum can be regarded as the sum over m — 1 variables on > 0, 
where the term with a m -i — is absent. Then 

k—m+l k—m+1 

Am) _ , , , _ , , , = Am-1) _ .(m-2) 

L k ~ / j L a 1 l 'a2 °a m -i / J '-oi^ °a m -2 — L k k 

cti cti 

Thus, 4 m) < 4 2) - Relation (2.10) implies equality t[ 2) = t k -i that gives (4.4). 
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Let us rewrite (5.3) in the form 

k— m 

4 m) - E fe-iv (5.6) 

q=0 

If m — 1 > 2, then we can apply (4.4) to and obtain that 

k — m 

4 m) < E = 

ifc-i — (ioife-2 + . . . + t TO _iffe_ m _i) < ife-i — ifc-2- 

Expression (2.8) for implies that t/j_2 > ifc-i/4. Therefore < 3ffc_i/4. 

If m — 2 is greater than 2, then we can substitute the last inequality into 
(5.6) and obtain that t k ™^ < ifc_i(3/4) 2 . Now it is clear that (5.2) is true. 

Using similar computations, one can easily prove the following statement. 

Lemma 5.2. 

Let us denote by n r (s) the number of trees that can be constructed on r > 1 
roots with the help of s edges. Then for 2 < I < s 

n r+ i(s - 1) < n r+1 {s - 1) <n r (s)(^j . (5.7) 

Proof theorem 5. 1 . 

We start with the observation that one can construct a tree t e T k from 
the set Ek of k edges ei that are already enumerated on the way that this 
enumeration will agree with the order among tree edges. 

Now assume that before this procedure the edges e^, e^+i, . . . , e^+m are 
joined to the same vertex and used in the construction as one cluster. According 
to Lemma 4.2, this diminishes exponentially with respect to tk the number of 
trees obtained on this way. Taking into account that h can vary from 1 to k — in 
we obtain the estimate (5.1).D 

In conclusion let us note that (5.1) implies that the number of trees that 
have the power of the maximal cluster large enough is exponentially small with 
respect to tk- Therefore the number of such clusters is exponentially small with 
the total number of clusters in trees r <G Tk. This means that the average degree 
of a tree vertex remains finite even in the limit k — > oo. We plan to study this 
problem mores systematically in a separate publication. 
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6 Proof of theorem 2.1. 

As it follows from (5.3) and (5.7), we need to estimate the sum 

8=1 - P T £T k 

where m, are the powers of clusters of r. Let us denote by m(r) the maximal 
cluster power of the tree r; then obviously 

(ml + ■ ■ ■ + mfy < HT) s (mi + ■■■ + mf = (2k) s E m(r) s . 

TeT k T£T k T<=T k 

Let us consider the set of trees T' k such that rh T < xlogfc, where x = 
(log 4- logS)" 1 . Then 

(2k) s ^ < t k (2k X \ogk) s < t k (2 X k) s(1+ ^ 

with some 9 = 6 k that can be taken vanishing as k — > oo. 

We represent the set T k \ T' k as the sum of sets G k {[x(j + logfe)]) of trees 
that have the maximal cluster power equal to [x{j + logfc)], j = 1, 2, . . .. Here 
we denoted by [x] the maximal natural number less than or equal to x. Using 
estimate (5.1), we can write that 

fc— [x lo s fc l 

E^< E E <^ 

tgt^ i=i T-eGfeO+lx logfe]) 

fe 

e 2 ^x s (logfc + i) s exp{-i-logfc}fci fe < e 2 (s + l)!( X logfc) s . 

i=i 

Thus, we obtain inequality (cf. (2.12)) 

Qk(s) <p- s (/3fc) s(1+9 'V 4s , (6.1) 

where /3 is a constant and 0' can be chosen vanishing when k — > oo. 

Now it is easy to derive the estimate (2.4) from (2.12). Taking into account 
(6.1) and the estimate V± k < (2fc) 2<5fc , we obtain inequality 

M<f< p) < t fc ((1 - ak 3 N- 1 m 1+2S + 9 'p- 1 ))^ . 
Then we can deduce that inequality 

M£' p) <et k < (l+e) 2k A k (6.2) 



13 



holds for all k sufficiently large and such that k <C iV 1 / 3 and k 1+s+e <C p. 
Regarding definition (1.2), we can write that 

M 2 ( f' p) < E( / X 2k d<r(X;A^)) > 4 fc (l+2 £ ) 2fe 7VE(#{|A^ p) | > 2(l+e)}) > 

J\\\>2(l+2e) 

4 fe (l + 2e) 2k Nfi a ® ^{cj : ||A^|| > 2(1 + 2e)}. 
Combining these inequalities with (6.2), we derive that 

/ 1 , \ 2fe 

/ i ®Mw:||A^||>2(l + 2 e )}<JV^ T ^J . 

Choosing fc such that k/\ogN — > oo, we obtain that (2.4) holds. Theorem 2.1 
is proved. 

7 Proof of theorem 2.2 

Let us consider the unit vectors e^) with the components 
We can write that 

2 

ll^pll 2 > max i= i j ... jA r A N ^ J) = max j=i,...,jv (^jv, p ) (7-1) 
where 

K, P ) (i,i) = El^, P (i,y)| 2 . 

Let us introduce random variables 

AT 



^ w) = EIV p (m)I 2 , i = i,...,JV. 

It is clear that the family {^j^} is the set of jointly independent random 
= WV (^.p) (j ' j) " .. Ul:,x v ^ • ( 7 - 2 ) 



variables and that 

iiiaji._ysi N _„) \J,J) ^ inaA / 

J 

Let us note that under conditions of theorem 2.2 

N 



^E^)^- (7.3) 



V=] 
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Let us consider the probability distribution of the random variable 



ffW = max hf\ 

j<N/2 J 



It is clear that 



P N (R) = Pi{H N < R} = Pr <j max nf° < pR 

\j<N/2 13 



N/2 



l[(l-Pr{ V ^> P R} 



It is not hard to prove that the random variables considered for p ~ log N 
converge as N — ► 00 to the Poissonian random variables that are identically 
distributed with the parameter p. Then we can write that 



1 - Pr \rlf ] > pi?} < 1 - Pr {Q = pi?} = 1 - 



P 



pR 



(pR)\ 



Let us show that if p = (log TV) 1 S , then pP R [e p (pi?)!] 1 decays more slowly 
than 2/N and 



(pi?)! 



as N — > 00. 



(7.4) 



Using the Stirling formula, we can write that 



pP R gP(fl-l) 



(pi?)! 



i?P« 



exp < —pi? 



logi?- 



r - 1 



It is easy to see that (7.4) holds that together with (7.1) and (7.2) proves relation 
(2.6). 



8 Summary and discussion 

We consider the ensemble of random matrices An, p with independent entries 
such that An,p has, in the average p non-zero entries per row. We study asymp- 
totic behaviour of the spectral norm ||Aat iP || in the limit when N, p — > 00 and 
p = o(N). We consider the averaged moments 

1 N 

M ^ P) = n £ £ E<^(x, yi) • • • A NtP (y 2k - U x)) (8.1) 

and give a further development to the approach proposed in papers pi |l4| |. 
The method is based on the relation between the even partitions ir of variables 
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(x, j/i, . . . ,j/2fe-i), i-e. those that make the average in (8.1) non-zero, from one 
hand and the rooted trees t E Tk constructed in the upper half-plane with the 
help of fc edges, from the other hand. 

We give the full description for the even partitions 7r in terms of the graphs 
7 obtained from a tree r by gluing its vertices. The magnitude of the terms 
described by partition 7r is determined by the number of cycles in corresponding 
graph 7. This allows one to estimate easily the number of different partitions 
and their contributions to . 

We show that the terms of the order p~ l M^u in the limit p = o(N), N — ► 
00 comes from those partitions it' that are encoded by the graphs 7' that have 
cycles of the length 2 exactly. This means that the average degree of the vertex 
plays the key role in asymptotic properties of large sparse random matrices. 

We obtain the estimate showing that the average in certain sense tree is 
of the linear structure and the relative number of possible 2-cycles increases 
almost linearly with respect to k — > 00. This allows us to derive our main 
result that ||Ajv, p || remains bounded when p ~ (log N) 1+5 , 5 > 0. We show 
that the value p c = log N is the critical one because ||Ajv iP || is unbounded when 
p~ (logjV) 1 " 5 . 

Our results can be compared with the classical Erdos-Renyi limit theorem 
H . This statement concerns independent random variables £1 , • • • , £jv and the 
partial sums Xj >p = £j + £i+i + • • • + £i+p- It is proved that the value p c = log N 
is the critical one for the X(N) = maxi Xi^p -1 to be either bounded or not. 

One can regard our theorems 2.1 and 2.2 as a limit theorems for stochastic 
versions of Erdos-Renyi partial sums. This claim is supported by the fact that 
random variables hj N ^ (5.3) are determined as the sum of approximately p 
random variables. Let us also note that theorem 2.1 concerns a random variable 
||Azv,p|| that is greater than X(N) = max.,- || ^4 iv,p<s^ J ' ) 1 1 - This could explain the 
need of conditions (2.2) that are more restrictive than the standard conditions 
of Erdos-Renyi limit theorem. 

One can also trace out more subtle link between our results and the well- 
known statement concerning the connectedness of random graphs (see e.g.||). If 
one has N vertices and draws q edges joining randomly chosen vertices, then the 
graph obtained will be connected in the limit N — > 00 provided q N log N. In 
our terms the randomly chosen pairs to be joined correspond to non-zero 
elements in the dilute matrix A^, P - In this context, the transition from discon- 
nected graph to the connected one can be regarded as analog of the transition 
of Ajv, p from the class of the tridiagonal matrices (like discrete Schrodinger 
operator) to the class of Wigner random matrices. 

It should be noted that in mathematical physics literature another critical 
values are found for sparse random matrices. In particular, in paper [ [l0| it 
is claimed that certain "density-density" correlation function changes its be- 
haviour at finite values of p. Numerical studies show that certain spectral 
characteristics of strongly dilute random matrices can depend on finite values of 
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p. Therefore our results imply that there can be several different critical values 
in the sparse random matrix model. 

To complete the discussion, let us note that the dilute random matrices 
An,p with p replaced by N take the form of the Wigner random matrices A^. 
Therefore technique developed in present paper can be also useful in this case. 
This topic is already addressed in Section 2. 
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